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1 Introduction 

This article gives an analysis of the NormalHedge algorithm in continuous time. The NormalHedge algorithm 
is described and analyzed in discrete time in [CFHJ. The continuous time analysis is mathematically cleaner, 
simpler and tighter than the discrete time analysis. 

To motivate the continuous time framework consider the problem of portfolio management. Suppose we 
are managing N different financial instruments allowed to define a desired distribution of our wealth among 
the instruments. We ignore the details of the buy and sell orders that have to be placed in order to reach 
the desired distribution, we also ignore issues that have to do with transaction costs, buy-sell spreads and 
the like. We assume that at each moment the buy and sell prices for a unit of a particular instrument are 
the same and that there are no transaction costs. 

Our goal is to find an algorithm for managing the portfolio distribution. In other words, we are looking 
for a mapping from past prices to a distribution over the instruments. As we are considering continuous time, 
the past can be arbitrarily close to the present. Formally speaking, we say that the portfolio distribution is 
"causal" or "unanticipating" to remove the possibly of defining a portfolio which is a function of the future 
gains as his would clearly be a cheat. We are interested in considering continuous time, because instrument 
prices can fluctuate very rapidly. 

To model this very rapid fluctuation we use a type of stochastic process called an Ito process to model 
the log of the price as a function of time. Intuitively, an Ito process is a linear combination of a differentiable 
process and white noise. A more formal definition is given below. To read more about Ito processes 
see |Osk03| . 

Our algorithm and its analysis do not make any additional assumption on the price movement of the 
instruments. Of course, with no additional assumption we cannot have any guarantees regarding our future 
wealth. For example, if the price of all of the instruments decreases at a particular moment by 10%, our 
wealth will necessary decrease by 10%, regardless of our wealth distribution. However, surprisingly enough, 
we can give a guarantee on the regret associated with our method without any additional assumptions. Regret 
quantifies the difference between our wealth at time t and the wealth we would have had if we invested all 
of our money in the best one of the N instruments. Specifically, denote the log price of instrument i at time 
t by XI and assume that the initial unit price for all instruments is one, i.e. Xq — log(l) = 0. Let Gt be the 
log of our wealth at time t. We define our regret at time t as 

Rt = max Xi — Gt ■ 

Intuitively, the regret is large if by investing all of our money in a particular instrument (whose identity is 
known only in hind-sight) we would have made much more money than what we actually have. The main 
result of this paper is an algorithm for which the regret is bounded by y^2ct{\nN + 1) where c is the amount 
of random fluctuations (white noise) in the instrument prices. It stands to reason that the bound depends on 
the number of instruments N, because the larger the potential impact of price fluctuations. If the instrument 
prices are are all simple random walks (brownian motion in continuous time) the expected price of the best 
instrument is proportional to y/ct \nN. 
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We can get potentially tighter bounds if we consider the set of best instruments. Suppose we sort 
for a particular time t from the largest to the smallest. We say that the e — quantile of the prices is the 
value X such that [eiVJ of are larger than x. We prove a bound of ■y/2ct(ln(l/e) + 1) on the regret of our 
algorithm relative to the e-quantile for any e > 0. This generalized bound can be used when we are hedging 
over an infinite, even uncountably infinite set of instruments. 

Case in point. Suppose that the instruments that we are combining are themselves portfolios. The set of 
fixed portfolios over > 1 instruments consists of the A*" — 1 dimensional simplex, which is an uncountably 
infinite set. Cover's universal porfolios algorithm |Cov91|, rC096| uses this set as the set of portfolios to be 
combined. We can apply our algorithm to this set and guarantee that our regret relative to the top e-quantile 
of fixed rebalanced portfolios is small. 

2 Hedging in Continuous time 

The portfolio management problem is defined as follows. Let X^ ior i — 1, . . . , N define the log-prices of 
N instrumentss as a function of time. The initial unit price is one, thus Xq = 0. X^ is an Ito process. 
More specifically, let dWt denote an N dimensional Wiener process where each coordinate is an independent 
process with unit variance. Then the differential of the total gain corresponding to the ith instrument is: 

N 

dXl = a\t)dt + S*'^ (t)dWi (1) 
i=i 

Where a*(f), 6*'-' (t) are adapted (non-anticipatory) stochastic processes that are Ito-integrable with respect 
to Wl . In mathematical finance terms correspond to price drift and 6*'-' correspond to diffusion or price 
volatility. 

The volatility of the ith instrument at time t is defined as 

N 2 

J=l 

and the maximal volitility at time t is defined as 

=maxl/'(i) (3) 

i 

An "aggregating strategy" is an portfolio management policy that defines how to distribute the wealth 
among the N instruments as a function of their past performance. Mathematically speaking, a policy is 
a stochastic process that is an adapted (non-anticipative) function of the past prices XI Given an 

■N 

instantiation of the stochastic processes Xl^^-^ the aggregating strategy defines N stochastic processes PI 
such that for all t > and ^ . P^ = 1. The cumulative gain of the master algorithm is defined to be at 
f = 0, for < > it is defined by the differential 

N 

dGt^Y^^tdXi 

The regret of the master algorithm relative to the ith instrument is defined to be zero at t = and is 
otherwise defined by the differential 

dRi = dXi - dGt 
And we can combine the last two equations to get: 

N 

Y^PidRl^O (4) 
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As Rf is a linear combination of XI it is also an Ito process and can be expressed as 



N 



dKl ^ a\t)dt + h''^{t)dWi 



where 



and 



N 



k=l 



N 



k=l 



Similarly to XI we define the diffusion rate of i?J to be 



N 



We prove an upper bound on V^{t) 
Lemma 1 



i=i 



(5) 



(6) 



(7) 



(8) 



Proof: We use b-' and the N dimensional vectors {b^'-' , ■ ■ ■ , b^'^) and (b^'^ , . . . , respectively.Using 
this notation we rewrite Equations ([7]) and ([5]) as 



N 



fc=i 



Equations ^ and ([5]) imply that ||b'(t)||2 < V^''{t) for all i. It follows that the norm of the convex 
combination is also bounded: 



N 



fe=i 



N 



2 fc=l 



From which it follows that 



N 



< 2V''' [t) 



3 Normalhedge 

NormalHedge is a particular aggregating strategy which is defined as follows. 
We define a potential function that depends on two variables, x and c: 



(t>{x, c) = 
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We will use the following partial derivatives of c): 



dx^ [ (x<0) ^ ^ ' ^ dx^^' 1 (x<0) 

and 

[ (x < 0) 

The NormalHedge strategy is defined by the following conditions that should hold for every t > 0. If 
i?J < for all 1 < i < then = Otherwise and c{t) are defined by the following equations. 



1 ^ 

-^0(i?j,cW)=e (9) 



1=1 



= /^^-^^.^^ (10) 



4 Analysis 



We introduce a new notion of regret. For a given time t we order the cumulative gains X'^ for i — I, . . . , N 
from highest to lowest and define the regret of the agregating strategy to the top e-quantile to be the difference 
between G(t) and the [eiVj-th element in the sorted list. 

Lemma 2 At any time t, the regret to the best instrument can he hounded as: 

maxi?i.f < yj2c{t){\nN + 1) 

i 

Moreover, for any < e < 1 and any t, the regret to the top e-quantile of instruments is at most 



v/2c(t)(ln(l/e) + l). 
Proof: The first part of the lemma follows from the fact that, for any i E Ef, 



which implies Rt,t < y/2c{t){\nN + 1). 

For the second part of the lemma, let Ri^t denote the regret of our algorithm to the instrument with the 
eA^-th highest price at time t. Then, the total potential of instruments with regrets greater than or equal to 
Ri t is at least: 

^"-(^)^"» 

from which the second part of the lemma follows. H 

We quote Ito's formula, as stated in [Osk03j (Theorem 4.2.1) 
Theorem 3 (Ito) Let 

dX{t) = udt + vdB{t) 

he an n-dimensional ltd process. Let g{t, x) — {gi{t, x), . . . , (?p(t, x)) he a map from [0, oo) x i?" into RP. 
The the process 

Y{t,Lo)^g{t,X{t)) 
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is again an ltd process, whose component number k, Y^, is given by 



dx,. 



2 ^ dxidxj 



{t, X)dXidXj 



where dBidBj = 5i_jdt, dBidt = dtdBi = 0. 

We now give the main theorem, which characterizes the rate of increase of c[t). 
Theorem 4 With probability one with respect to the Weiner process 



dt 



< W'\t) 



Proof: We denote the potential corresponding to the ith instrument by potential by <i>J, i.e. 

^\ = muc{t)) 

Using Ito's formula we can derive an equation for the differential d^\: 



d^l 



'^cp\R^^c{t))+a'{t)^\B},,c{t)) + \\^{V^\t))A 



JV 



dt + dRl<t,'{Rlc{t)) 



(11) 



We sum Equation pT|) over all instruments. As c{t) is chosen so that the average potential is constant, 
the differential of the average potential is zero. We thus get: 



N 



= 



(12) 



1=1 

N 

E 



dc{t) 
dt 



iRlcit)) + Uf2(b'''it)A ^"{R'ucit)) 



N 



dt + J2dRl<l^'{Rlc{t)) (13) 



From Equation ((4]) we know that the last term is equal to zero. Removing this term and reorganizing the 
equation we arrive at an expression for the rate of change of c(t) : 



dc{t) Ef=i {EUib-'Ht)?) r{Rlc{t)) 



dt 2j:l,nRlc{t)) 
we plug in the definitions of F*(i),0^ and (/>' to get: 

dc{t) E,;«j>o^iO (4 + f^) exp (^) 



dt 



^ 2^i-R\>Q TiFj^ y 2c(t) J 



Multiplying the enumerator and denominator by c{t), using the bound V'^{t) < V^'^ and denoting xt 
Rl/y^c{t) we get the inequality 



dt 



(14) 



The maximum of the ratio on the right hand side under the constraint (1/A^) X^i a; >o e^'^^ = e is achieved 
when Xi — \/2 for all i. Plugging this value back into equation (rcfcqn:final) yields the statement of the 
theorem. ■ 
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5 references 



There are many good sources for stochastic differential equations and the Ito calculus. One which 1 found 
particularly appealing is a set of lecture notes for a course on "Stochastic Calculus, Filtering, and Stochastic 
Control" by Ramon van Handel, available from the web here: 
http : //www . princeton . ed.u/~rvcin/acm217/ACM217 . pdf 
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